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We study spatial networks constructed by randomly placing nodes on a manifold and joining 
two nodes with an edge whenever their distance is less than a certain cutoff. We derive the general 
expression for the connectivity distribution of such networks as a functional of the distribution 
of the nodes. We show that for regular spatial densities, the corresponding spatial network has 
a connectivity distribution decreasing faster than an exponential. In contrast, we also show that 
scale-free networks with a power law decreasing connectivity distribution are obtained when a certain 
information measure of the node distribution (integral of higher powers of the distribution) diverges. 
We illustrate our results on a simple example for which we present simulation results. Finally, we 
speculate on the role played by the limiting case P(k) cx fc _1 which appears empirically to be relevant 
to spatial networks of biological origin such as the ones constructed from gene expression data. 



I. INTRODUCTION 

In contrast to abstract graphs, many real networks 
are embedded in a metric space: The interactions be- 
tween the nodes depend on their spatial distance and 
usually take place between nearest neighbors. Examples 
of such networks are transportation and communication 
networks, friendship or contact networks [1,2]. An es- 
pecially important example is the Internet [3,4], which 
is a set of routers linked by physical cables with differ- 
ent lengths. Several recent studies have investigated net- 
works whose nodes are embedded in a metric space, and 
where the probability of connecting two nodes with an 
edge depends on their distance [1,2,4-10]. 

On the other hand, the concept of scale-free network 
has emerged in the last few years as a powerful unify- 
ing paradigm in the study of complex systems of natu- 
ral, technological and social origin (see Refs. [11,12]). It 
is therefore natural to investigate the possibility of em- 
bedding scale-free networks in space. In particular in 
Refs. [9,10], the general problem of embedding a scale- 
free network of given connectivity distribution in a Eu- 
clidean lattice was studied. 

In this paper we take a somewhat reversed point of 
view: Our starting point is a spatial distribution of points 
on a continuous manifold M. Such points are the nodes 
of a network built by joining two nodes whenever their 



distance is less than a certain cutoff. In the following we 
will call 'spatial networks' those obtained by this proce- 
dure (sometimes also termed as 'ad hoc networks' [2]). 
We study how the connectivity distribution depends on 
the distribution of the nodes on M. The fact that the 
nodes live in a continuous manifold rather than a lat- 
tice is important in generating scale-free networks since 
the number of neighbors that a node can have within 
a certain distance is not limited a priori by the lattice 
structure. 

Spatial networks originating from a uniform distri- 
bution of the nodes were studied in Ref. [7], where it 
was shown that while the connectivity distribution takes 
the same Poisson form as in the Erdos-Renyi random 
networks (see eg. [13]), other important features, most 
notably the clustering coefficient, are radically different 
from the Erdos-Renyi case. The formation of giant clus- 
ters in such networks was studied in Ref. [7,2]. 

A natural application of this class of networks is the 
study of epidemics propagation. While scale-free net- 
works in which the connectivity does not depend on a 
pre-existing metric structure do not display an epidemic 
threshold [14], the situation changes when geographical 
closeness of two nodes influences their probability to be 
connected [9]. In our networks geographical closeness 
completely determines whether two nodes will be con- 
nected, so that we expect to find an epidemic threshold. 
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Spatial networks appear to be a rather realistic model 
for epidemic propagation in animal or plant populations, 
where we do not expect any individual to have an inter- 
action range very different from the average (as is the 
case for highly mobile human populations), while we do 
expect the population density to be non-uniform. 

In addition, spatial networks of biological origin have 
also recently been constructed, especially from gene ex- 
pression data obtained from micro-array experiments 
[15-17]. The connectivity distribution of such networks 
turns out to be a truncated power-law with the exponent 
of the power-law decay often close to unity. Interestingly, 
also networks constructed from gene expression data by 
a different rule [18], which does not satisfy our defini- 
tion of a spatial network, display a similar connectivity 
distribution. 

These facts motivated us to study how the connectivity 
distribution of a spatial network depends on the distri- 
bution of the nodes in space and under which conditions 
a scale-free network can be obtained, and finally whether 
an exponent close to unity plays any special role in this 
context. 



II. THE CONNECTIVITY DISTRIBUTION OF A 
SPATIAL NETWORK 

A. General Expression 

The N nodes of the network are supposed to be in a 
D-dimensional space and we will assume that they are 
distributed randomly in space with density p(x) . Given 
a node chosen at random, the probability that it is placed 
within a given domain of space m is 

/ dxp(x) (1) 

J m 

(we denote the integration measure by dx independently 
of the dimension D). Once the nodes are distributed in 
this space, we have to construct the edges. We will use a 
simple cut-off rule: Given two nodes i and j, located at 
Xi and Xj respectively, an edge will join them if 

d(xi,Xj) < R , (2) 

where d(x, y) is the distance between x and y. There- 
fore, once the nodes have been distributed the network 
is completely determined by the choice of the cutoff R. 
This model follows strictly the rule used to construct net- 
works based on gene expression data in Refs. [15-17], but 
more generally, it can be used to model the case where 
the interaction has a typical scale given by R. 

Denoting by B R {x) the ball of radius R centered in x 

B R (x) = {yeM :d(x,y)<R} , (3) 



the probability that a given node is placed within B R (x) 
is 

q R (x) = f dx'p(x') . (4) 

JBr(x) 

If we consider a node located at x, the probability that 
it will have k neighbors is then just the probability that 
k additional nodes are located in the B R {x). The con- 
nectivity distribution for a node placed in x is thus given 
by the binomial distribution 

P(k;x,R)= ( iV ^ 1 )teW fc [1 - QR(x)] N - 1 - k ■ (5) 

In the following, we will be concerned with the limit 
N — > oo: In order to obtain a well defined connectivity 
distribution in this limit, one has to take the limit R — > 0, 
so as to ensure that the product 

N V(R) (6) 

where V(R) is the volume of the ball 

V(R) = [ dx , (7) 

JBr(x) 

tends to a finite constant a. This means that R must 
scale as N~ D as TV — > oo, and implies that the expected 
number of nodes found within the ball B R (x) remains 
finite in this limit: 

N q R (x) -> a p(x) . (8) 

The constant a fixes the scale of the average connectivity 
(k) of the network. Indeed, from Eq.(ll) derived below, 
it is easy to obtain the following relation between a and 

(<=) 

Let us note that although the connectivity distribution 
is well-defined for any value a, it has been shown in the 
case of a uniform density [7,2] that the existence of a gi- 
ant connected component implies a minimum value of a 
(or equivalently (k)). We will not address this problem 
here but we can expect that for other densities there will 
be some similar conditions on a for the existence of a 
giant component. 

In the limit N — > oo, R — > 0, the connectivity distri- 
bution for a node located at x is Poissonian and Eq.(5) 
becomes 

P(i;i,a)4aVW^ op(l) • (10) 
k\ 

For the whole space, the connectivity distribution of the 
network is then obtained as the spatial average of the 
former expression and is 
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P(k;a) = ^ j dxp k+1 {x)e 



ctp(x) 



(11) 



This formula solves the general problem of determining 
the connectivity distribution of a spatial network from 
the spatial distribution of the nodes. For a uniform dis- 
tribution p(x) , we recover a Poissonian connectivity dis- 
tribution as in Rcf. [7]. However, Eq. (11) shows that 
other connectivity distributions can be obtained. In the 
following section we will determine under which condition 
a spatial node distribution generates a scale-free network 
instead. 



B. Scale-free spatial networks 

Equation (11) allows us to determine the condition 
that must be satisfied by the node density p(x) for the 
corresponding network to be scale-free. A network is 
scale-free if the moments (k v ) of its connectivity distri- 
bution diverge for v larger than a certain u max . It is easy 
to compute the moments of the connectivity distribution 
of a spatial network from Eq. (11): For integer v we have 



(0«=y« fc 7r/ dxp k+1 (x)e- ap{x) 
k kl Jm 

= E E i S( " l)(k ^ [ dxp k+1 (x)e- a ^ (12) 
= S^a" 1 f dxp m+1 (x) , (13) 

m=0 



where (k) m = k(k — 1) . . . (k — m + 1) and sl" 1 ^ are the 
Stirling numbers of the second kind (see e.g. Ref. [19], p. 
824), defined by 



x v = Si, m) x(x- l)...(x-m + l) . (14) 

It follows that (k v ) a exists if and only if the integrals 

H a = [ dxp a+1 (x) (15) 



exist for all a < v. Conversely, the network is scale-free if 
there exists a ^ max such that H a diverges for all a > ^ max - 
The integral Eq. (15) is a measure of the information con- 
tained in the probability distribution p(x) , and is simply 
related to the Renyi entropy [20] 



R a 



1 



1-q 



log H q -x 



(16) 



C. Classes of spatial networks 

More generally Eq. (11) allows us to determine the 
type of spatial network obtained for a given spatial node 
distribution. The networks can essentially be distin- 
guished by the decay of the connectivity distribution for 
large connectivities [23]. For spatial networks, large con- 
nectivities are obtained in high density regions, namely 
maxima of p(x). For the sake of simplicity, we will limit 
the discussion to the case of an isotropic distribution 
p(x) = p(r) where r is the modulus of x. We will also 
suppose that we have one density maximum located at 
r = 0. We will then distinguish the following two cases: 

• p(0) = po is finite and decays sufficiently rapidly so 
that all the quantities H a are finite. This could be 
for instance the case for population density which 
is decreasing exponentially from the city center 
[21,22]. In this case, an asymptotic evaluation of 
the integral in Eq. (11) shows that for large k the 
connectivity distribution decays as 



P(k;a) 



{apo) k 
klk D 



(17) 



As expected, the low density fluctuations are re- 
flected in the fast decay of the connectivity distri- 
bution and the corresponding spatial network will 
be of the 'exponential' type [23] (ie. the connec- 
tivity distribution decreases at least as fast as an 
exponential). 

• p(r - 0) - r~ with < D (if > D a cut-off 
is needed to normalize p(r) and we are in the first 
situation where the maximum of p(r) is finite) . In 
this case, the information measure H a will diverge 
for a > § — 1 and the connectivity distribution is 
a power-law: Its large-A; behavior is given by (see 
section III) 



P{k-a) - k~ D l 



(18) 



The large density fluctuations allow here for the ex- 
istence of nodes with very large connectivities and 
the corresponding network is scale-free. 

Even if it might appear unlikely that spatial densities 
behave pathologically around some points, this is actu- 
ally the case in many instances where the nodes live in 
an abstract space, with a distance defined by correla- 
tions. Such examples are obtained in the case of gene 
expression networks [15-18] (or the stock market [24]) 
for which the distance is defined in terms of Pearson cor- 
relation coefficient between nodes. The spatial network 
has then a simple meaning: Two nodes are connected if 
their correlation is high enough. It has been observed 
that in this example, the connectivity distribution is a 
truncated power-law with exponent of order 1 [15-17]. 
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D. The dependence on the average connectivity 

In this section we consider the dependence of the con- 
nectivity distribution (11) of a spatial network on the 
parameter a that, as discussed in Sec. 2, is a measure of 
the average connectivity. From Eq.(ll) one immediately 
obtains the following equation governing the dependence 
of P(k; a) on k: 

dP(k; a) = 1 a )-{k + 1) P(k + 1; a)} . (19) 

da a 

Note that while all spatial networks obey this equation, 
the converse is not true: One can easily construct con- 
nectivity distributions P(k; a) satisfying Eq. (19) that 
cannot be obtained from a spatial network. 

Inspection of Eq. (19) shows that the limiting case 
P(k) oc k^ 1 corresponds to a fixed point, where the con- 
nectivity distribution does not depend on a. This ob- 
servation might be relevant in explaining the common 
appearance of scale-free networks with connectivity ex- 
ponent ~ 1 constructed as spatial networks from gene 
expression data [15-17]. 



III. A SIMPLE EXAMPLE OF A SCALE-FREE 
SPATIAL NETWORK 

In this section we discuss an explicit example of a one- 
dimensional distribution of nodes that generates a scale- 
free network. We calculate exactly the connectivity dis- 
tribution in the limit of infinite number of nodes using 
Eq. (11). We also address the issue of finite-size effects 
and we compare the exact result for infinite size to nu- 
merical simulation for finite networks. 



A. Connectivity distribution for an infinite network 

We consider the space to be the open interval (0, 1) 
and the node distribution to be 

p(x) = (1 - (3) (3<1 . (20) 

The information measure H a is given by 



H a = (1 -f3) a+1 / dxx- fi{a+ ^ 



and diverges for 



a>—l 



(21) 



(22) 



Therefore we expect a scale-free network with (k v ) a di- 
vergent for v > jj — 1. 

The connectivity distribution can be computed explic- 
itly from Eq. (11) and is 



a 



P(A;;a ) = ^_ ( l_ / 3 ) fe+i 



f dxx-^+^e-^ 1 -^^ (23) 
Jo 

r fc + l-I,a(l-/3) 



r 



r(fc + i) 

where r(x, a) is the incomplete Gamma function 



T(x,a) 



dtt^e-* 



Since for 



r(z, a) 



, (24) 



(25) 



(26) 



we have for k — > oo 



1 



P(fc;a) „_|_[ a (l _/?)]!//». 

ap 



( fc + 1 -^) 
T(k + 1) 



J-[a(l-(3)]^k-^ . 
ap 



(27) 
(28) 



This result shows explicitly that scale-free networks with 
any value of the connectivity exponent down to 1 can in- 
deed be obtained as a spatial networks, as it is observed 
in gene expression networks. Finally, we remark that 
the transition to a power-law behavior of the connectiv- 
ity distribution (27) depends on a and [3. Indeed, when 
the second argument a(l — (3) of the incomplete gamma 
function tends to zero, we have a power-law behavior for 
P(k;a) even at small k, as can be seen from Fig. (1). 



B. Finite-size effects: Numerical results 

Since real spatial networks contain a finite number of 
nodes and our analytical results were obtained in the 
limit of infinite networks, it is important to address the 
issue of finite size effects. In this section, we approach 
the problem from a numerical point of view by construct- 
ing finite spatial networks by Monte Carlo methods and 
comparing their connectivity distribution with the theo- 
retical predictions. 

The figure 1 shows the result of such comparison for 
the one-dimensional spatial networks studied in the pre- 
vious subsection, at (3 = 0.5 and (3 — 0.9, and a = 5. For 
a finite network, a is naturally defined as 2NR, where N 
is the number of nodes and R is the distance cutoff used 
to define links. In each part of the figure, the connec- 
tivity distributions for N = 1000 and N = 20000 points 
are superimposed to the theoretical distribution given by 
Eq. (24). We can conclude that already for moderately 
sized networks the connectivity distribution is very close 
to the N — > oo behavior Eq. (24), thus indicating that 
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our results provide a good approximation to the connec- 
tivity distribution of finite spatial networks. 
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FIG. 1. Monte Carlo simulation of the spatial networks 
constructed from the one-dimensional node distribution (20) 
compared to the N — > oo limit Eq.(24). The two figures refer 
respectively to /3 = 0.5 (0.9) with a = 5, and show for each 
case the result for N = 1000 and N = 20000. 



IV. DISCUSSION 

We have presented a systematic analysis of the con- 
nectivity distribution of spatial networks constructed by 
joining nodes closer to each other than a cutoff distance, 
in the limit in which the number of nodes tends to infin- 
ity. The main results of our analysis can be summarized 
as follows. First, the connectivity distribution can be ex- 
pressed as a functional of the spatial distribution of the 
nodes, Eq. (11). The moments of the connectivity distri- 
bution are related to a certain measure of information H a 
of the node distribution through Eq. (13). In particular, 
scale-free networks arise from this construction whenever 
the information measure H a diverges for some a and we 
showed that scale-free networks with any exponent 7 > 1 
can be constructed as spatial networks. Our results were 
obtained in the limit of infinite number N of nodes but 
numerical results suggest that they provide an excellent 
approximation of real, finite spatial networks already for 
moderate N. Finally, the analysis of the dependence of 
the connectivity distribution P(k; a) on the connectivity 
scale a suggests that the limiting case P(k) oc fc _1 cor- 
responds to a fixed point, a fact that might be related to 
the empirical observation that several spatial networks 
constructed from gene expression data show a scale-free 
connectivity distribution with exponent close to 1. 

Intuition suggests that spatial networks will be highly 
clustered and highly assortative. The clustering coeffi- 
cient of spatial networks was studied in Ref. [7] for a 



uniform node distribution: However since the cluster- 
ing coefficient is a local quantity and does not depend 
on the spatial density of nodes, the results of [7] should 
hold unchanged for all spatial networks, as long as the 
node distribution is isotropic in space. Moreover, spa- 
tial networks can be expected to display a high degree of 
assortativity, since high connectivity nodes are placed in 
high density regions and are therefore more likely to be 
connected to other high connectivity nodes. Indeed em- 
pirical spatial networks constructed from gene expression 
data show a high degree assortativity [16]. A closely re- 
lated issue is the determination of the diameter of spatial 
networks. Due to the embedding in a metric space, we 
expect the diameter to grow as a power of the number of 
nodes, and therefore we do not expect spatial networks 
to belong to the small-world networks class. 

Perhaps the most interesting open problem is to clarify 
the role of the limit case P(k) oc namely to classify 
the node distributions that flow to this fixed point in 
some limit. An example is given by Eq.(24) in the (3 — > 1 
limit. P(k) oc kr 1 implies that the measure of infor- 
mation H a diverges for all a, which in turn implies that 
the node distribution must become degenerate (that is 
its support must shrink to zero measure). However this 
is not a sufficient condition, since for example a Gaus- 
sian p(x) in the limit of zero variance, while becoming 
degenerate, does not flow to P(k) oc fe _1 . 
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